13 research outputs found
Deep Directional Statistics: Pose Estimation with Uncertainty Quantification
Modern deep learning systems successfully solve many perception tasks such as
object pose estimation when the input image is of high quality. However, in
challenging imaging conditions such as on low-resolution images or when the
image is corrupted by imaging artifacts, current systems degrade considerably
in accuracy. While a loss in performance is unavoidable, we would like our
models to quantify their uncertainty in order to achieve robustness against
images of varying quality. Probabilistic deep learning models combine the
expressive power of deep learning with uncertainty quantification. In this
paper, we propose a novel probabilistic deep learning model for the task of
angular regression. Our model uses von Mises distributions to predict a
distribution over object pose angle. Whereas a single von Mises distribution is
making strong assumptions about the shape of the distribution, we extend the
basic model to predict a mixture of von Mises distributions. We show how to
learn a mixture model using a finite and infinite number of mixture components.
Our model allows for likelihood-based training and efficient inference at test
time. We demonstrate on a number of challenging pose estimation datasets that
our model produces calibrated probability predictions and competitive or
superior point estimates compared to the current state-of-the-art
Robust and Efficient Deep Visual Learning
The past decade was marked by significant progress in the field of artificial intelligence and statistical learning. However, the most impressive of modern models come in the form of computationally expensive black boxes, with the majority of them lacking the ability to reason about the confidence of their predictions robustly. Being capable of quantifying model uncertainty and recognizing failure scenarios is crucial when it comes to incorporating them into complex decision-making pipelines, e.g. autonomous driving or medical image analysis systems. It is also important to maintain a low computational cost of these models.
In the present thesis, the aforementioned desired properties of robustness and efficiency of deep learning models are studied and developed in the three specific realms of computer vision. First, we investigate deep probabilistic models that allow uncertainty quantification, i.e. the models that "know what they do not know". Here, we propose a novel model for the task of angular regression that allows probabilistic object pose estimation from 2D images. We also showcase how the general deep density estimation paradigm can be adapted and utilized in two other real-world applications, ball trajectory prediction and brain imaging.
Next, we turn to the field of 3D shape analysis and rendering. We propose a method for efficient encoding of 3D point clouds, the type of data that is hard to handle with conventional learning algorithms due to its unordered nature. We show that simple neural networks that use the developed encoding as input can match the performance of state-of-the-art methods on various point cloud processing tasks while using orders of magnitude less floating-point operations.
Finally, we explore the emerging field of neural rendering and develop the framework that connects classic deformable 3D body models with modern image-to-image translation neural networks. This combination allows efficient photorealistic human avatar rendering in a controlled manner, with the possibility to control the camera flexibly and to change the body pose and shape appearance.
The thesis concludes with the discussion of the presented methods, including current limitations and future research directions
ResFields: Residual Neural Fields for Spatiotemporal Signals
Neural fields, a category of neural networks trained to represent
high-frequency signals, have gained significant attention in recent years due
to their impressive performance in modeling complex 3D data, especially large
neural signed distance (SDFs) or radiance fields (NeRFs) via a single
multi-layer perceptron (MLP). However, despite the power and simplicity of
representing signals with an MLP, these methods still face challenges when
modeling large and complex temporal signals due to the limited capacity of
MLPs. In this paper, we propose an effective approach to address this
limitation by incorporating temporal residual layers into neural fields, dubbed
ResFields, a novel class of networks specifically designed to effectively
represent complex temporal signals. We conduct a comprehensive analysis of the
properties of ResFields and propose a matrix factorization technique to reduce
the number of trainable parameters and enhance generalization capabilities.
Importantly, our formulation seamlessly integrates with existing techniques and
consistently improves results across various challenging tasks: 2D video
approximation, dynamic shape modeling via temporal SDFs, and dynamic NeRF
reconstruction. Lastly, we demonstrate the practical utility of ResFields by
showcasing its effectiveness in capturing dynamic 3D scenes from sparse sensory
inputs of a lightweight capture system.Comment: Project page and code at https://markomih.github.io/ResFields
Real Time Trajectory Prediction Using Deep Conditional Generative Models
Data driven methods for time series forecasting that quantify uncertainty
open new important possibilities for robot tasks with hard real time
constraints, allowing the robot system to make decisions that trade off between
reaction time and accuracy in the predictions. Despite the recent advances in
deep learning, it is still challenging to make long term accurate predictions
with the low latency required by real time robotic systems. In this paper, we
propose a deep conditional generative model for trajectory prediction that is
learned from a data set of collected trajectories. Our method uses encoder and
decoder deep networks that maps complete or partial trajectories to a Gaussian
distributed latent space and back, allowing for fast inference of the future
values of a trajectory given previous observations. The encoder and decoder
networks are trained using stochastic gradient variational Bayes. In the
experiments, we show that our model provides more accurate long term
predictions with a lower latency that popular models for trajectory forecasting
like recurrent neural networks or physical models based on differential
equations. Finally, we test our proposed approach in a robot table tennis
scenario to evaluate the performance of the proposed method in a robotic task
with hard real time constraints
ΠΠ½ΡΠΈΠΏΠ»ΠΎΡΠΊΠ°Ρ ΠΎΡΠ΅ΡΠΈΠΌΠΌΠ΅ΡΡΠΈΡΠ½Π°Ρ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΡ Π½Π΅ΡΠΆΠΈΠΌΠ°Π΅ΠΌΠΎΠ³ΠΎ ΡΠ΅Π»Π° Π² ΡΡΠ»ΠΎΠ²ΠΈΡΡ ΠΏΠΎΠ»Π·ΡΡΠ΅ΡΡΠΈ
Flow of incompressible medium under varying gradient of pressure is considered. It is assumed that
medium exhibits nonlinear elastic and creep behavior. The theory of large strains based on transport
equations for the tensors of reversible and irreversible deformations is used for problem formulation.
Analytical and numerical methods are applied to solve the problemΠ Π°ΡΡΠΌΠ°ΡΡΠΈΠ²Π°Π΅ΡΡΡ ΡΠ΅ΡΠ΅Π½ΠΈΠ΅ Π½Π΅ΡΠΆΠΈΠΌΠ°Π΅ΠΌΠΎΠΉ ΡΡΠ΅Π΄Ρ Π² ΡΠΈΠ»ΠΈΠ½Π΄ΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΡΠ±Π΅ ΠΏΠΎΠ΄ Π΄Π΅ΠΉΡΡΠ²ΠΈΠ΅ΠΌ ΠΈΠ·ΠΌΠ΅Π½ΡΡ-
ΡΠ΅Π³ΠΎΡΡ ΠΏΠ΅ΡΠ΅ΠΏΠ°Π΄Π° Π΄Π°Π²Π»Π΅Π½ΠΈΡ. ΠΠ°ΡΠ΅ΡΠΈΠ°Π» ΠΏΡΠΎΡΠ²Π»ΡΠ΅Ρ Π½Π΅Π»ΠΈΠ½Π΅ΠΉΠ½ΡΠ΅ ΡΠΏΡΡΠ³ΠΈΠ΅ ΠΈ Π²ΡΠ·ΠΊΠΈΠ΅ ΡΠ²ΠΎΠΉΡΡΠ²Π°. ΠΠ°ΡΠ΅ΠΌΠ°-
ΡΠΈΡΠ΅ΡΠΊΠ°Ρ ΠΌΠΎΠ΄Π΅Π»Ρ ΡΡΡΠΎΠΈΡΡΡ Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ΠΌ ΡΠ΅ΠΎΡΠΈΠΈ Π±ΠΎΠ»ΡΡΠΈΡ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΉ, ΠΎΡΠ½ΠΎΠ²Π°Π½Π½ΠΎΠΉ Π½Π° Π΄ΠΈΡΡΠ΅-
ΡΠ΅Π½ΡΠΈΠ°Π»ΡΠ½ΡΡ
ΡΡΠ°Π²Π½Π΅Π½ΠΈΡΡ
ΠΏΠ΅ΡΠ΅Π½ΠΎΡΠ° Π΄Π»Ρ ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΡ
ΠΈ Π½Π΅ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΡ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΉ. Π Π΅ΡΠ΅Π½ΠΈΠ΅ ΠΈΡΠ΅ΡΡΡ
Ρ ΠΏΠΎΠΌΠΎΡΡΡ Π°Π½Π°Π»ΠΈΡΠΈΡΠ΅ΡΠΊΠΈΡ
ΠΈ ΡΠΈΡΠ»Π΅Π½Π½ΡΡ
ΠΌΠ΅ΡΠΎΠ΄ΠΎ
ΠΠ½ΡΠΈΠΏΠ»ΠΎΡΠΊΠ°Ρ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΡ ΡΠΏΡΠΎΡΠ½ΡΡΡΠ΅ΠΉΡΡ ΡΠΏΡΡΠ³ΠΎΠ²ΡΠ·ΠΊΠΎΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΠ΅Π΄Ρ
Deforming of hardening elastoviscoplastic medium under the action of variable pressure gradient is con-
sidered in this paper. The displacement vector of material points has only vertical component. Mathe-
matical model is based on the theory of large elastoplastic deformations. Differential transport equations
of tensors of reversible and irreversible deformations are formulated. Irreversible deformations are split
up into plastic and creep deformations. The solution is obtained with the use of analytical and numerical
methods. The influence of hardening and viscosity parameters on medium deformation is analyzed.Π ΡΠ°Π±ΠΎΡΠ΅ ΡΠ°ΡΡΠΌΠΎΡΡΠ΅Π½Π° Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΡ ΡΠΏΡΠΎΡΠ½ΡΡΡΠ΅ΠΉΡΡ ΡΠΏΡΡΠ³ΠΎΠ²ΡΠ·ΠΊΠΎΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΠ΅Π΄Ρ ΠΏΠΎΠ΄ Π΄Π΅ΠΉΡΡΠ²ΠΈΠ΅ΠΌ ΠΈΠ·ΠΌΠ΅Π½ΡΡΡΠ΅Π³ΠΎΡΡ ΠΏΠ΅ΡΠ΅ΠΏΠ°Π΄Π° Π΄Π°Π²Π»Π΅Π½ΠΈΡ. ΠΠ΅ΠΊΡΠΎΡ ΠΏΠ΅ΡΠ΅ΠΌΠ΅ΡΠ΅Π½ΠΈΠΉ ΡΠΎΡΠ΅ΠΊ ΡΡΠ΅Π΄Ρ ΠΈΠΌΠ΅Π΅Ρ ΡΠΎΠ»ΡΠΊΠΎ Π²Π΅ΡΡΠΈΠΊΠ°Π»ΡΠ½ΡΡ ΠΊΠΎΠΌΠΏΠΎΠ½Π΅Π½ΡΡ. ΠΠ°ΡΠ΅ΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠ°Ρ ΠΌΠΎΠ΄Π΅Π»Ρ ΠΎΡΠ½ΠΎΠ²Π°Π½Π° Π½Π° ΡΠ΅ΠΎΡΠΈΠΈ Π±ΠΎΠ»ΡΡΠΈΡ
ΡΠΏΡΡΠ³ΠΎΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΈΡ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΉ, Π² ΡΠ°ΠΌΠΊΠ°Ρ
ΠΊΠΎΡΠΎΡΠΎΠΉ ΡΡΠΎΡΠΌΡΠ»ΠΈΡΠΎΠ²Π°Π½Ρ Π΄ΠΈΡΡΠ΅ΡΠ΅Π½ΡΠΈΠ°Π»ΡΠ½ΡΠ΅ ΡΡΠ°Π²Π½Π΅Π½ΠΈΡ ΠΏΠ΅ΡΠ΅Π½ΠΎΡΠ° ΡΠ΅Π½Π·ΠΎΡΠΎΠ² ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΡ
ΠΈ Π½Π΅ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΡ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΉ. ΠΡΠΈ ΡΡΠΎΠΌ Π½Π΅ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΠ΅ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΈ Π²ΠΊΠ»ΡΡΠ°ΡΡ Π² ΡΠ΅Π±Ρ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΈ ΠΏΠΎΠ»Π·ΡΡΠ΅ΡΡΠΈ ΠΈ ΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΈΠ΅ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΈ. Π Π΅ΡΠ΅Π½ΠΈΠ΅ Π·Π°Π΄Π°ΡΠΈ ΠΏΠΎΠ»ΡΡΠ΅Π½ΠΎ Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ΠΌ Π°Π½Π°Π»ΠΈΡΠΈΡΠ΅ΡΠΊΠΈΡ
ΠΈ ΡΠΈΡΠ»Π΅Π½Π½ΡΡ
ΠΌΠ΅ΡΠΎΠ΄ΠΎΠ². ΠΡΠΎΠ°Π½Π°Π»ΠΈΠ·ΠΈΡΠΎΠ²Π°Π½ΠΎ Π²Π»ΠΈΡΠ½ΠΈΠ΅ ΠΏΠ°ΡΠ°ΠΌΠ΅ΡΡΠΎΠ² ΡΠΏΡΠΎΡΠ½Π΅Π½ΠΈΡ ΠΈ
Π²ΡΠ·ΠΊΠΎΡΡΠΈ Π½Π° Π΄Π΅ΡΠΎΡΠΌΠΈΡΠΎΠ²Π°Π½ΠΈΠ΅ ΡΡΠ΅Π΄
ΠΠ½ΡΠΈΠΏΠ»ΠΎΡΠΊΠ°Ρ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΡ ΡΠΏΡΠΎΡΠ½ΡΡΡΠ΅ΠΉΡΡ ΡΠΏΡΡΠ³ΠΎΠ²ΡΠ·ΠΊΠΎΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΠ΅Π΄Ρ
Deforming of hardening elastoviscoplastic medium under the action of variable pressure gradient is con-
sidered in this paper. The displacement vector of material points has only vertical component. Mathe-
matical model is based on the theory of large elastoplastic deformations. Differential transport equations
of tensors of reversible and irreversible deformations are formulated. Irreversible deformations are split
up into plastic and creep deformations. The solution is obtained with the use of analytical and numerical
methods. The influence of hardening and viscosity parameters on medium deformation is analyzed.Π ΡΠ°Π±ΠΎΡΠ΅ ΡΠ°ΡΡΠΌΠΎΡΡΠ΅Π½Π° Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΡ ΡΠΏΡΠΎΡΠ½ΡΡΡΠ΅ΠΉΡΡ ΡΠΏΡΡΠ³ΠΎΠ²ΡΠ·ΠΊΠΎΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΎΠΉ ΡΡΠ΅Π΄Ρ ΠΏΠΎΠ΄ Π΄Π΅ΠΉΡΡΠ²ΠΈΠ΅ΠΌ ΠΈΠ·ΠΌΠ΅Π½ΡΡΡΠ΅Π³ΠΎΡΡ ΠΏΠ΅ΡΠ΅ΠΏΠ°Π΄Π° Π΄Π°Π²Π»Π΅Π½ΠΈΡ. ΠΠ΅ΠΊΡΠΎΡ ΠΏΠ΅ΡΠ΅ΠΌΠ΅ΡΠ΅Π½ΠΈΠΉ ΡΠΎΡΠ΅ΠΊ ΡΡΠ΅Π΄Ρ ΠΈΠΌΠ΅Π΅Ρ ΡΠΎΠ»ΡΠΊΠΎ Π²Π΅ΡΡΠΈΠΊΠ°Π»ΡΠ½ΡΡ ΠΊΠΎΠΌΠΏΠΎΠ½Π΅Π½ΡΡ. ΠΠ°ΡΠ΅ΠΌΠ°ΡΠΈΡΠ΅ΡΠΊΠ°Ρ ΠΌΠΎΠ΄Π΅Π»Ρ ΠΎΡΠ½ΠΎΠ²Π°Π½Π° Π½Π° ΡΠ΅ΠΎΡΠΈΠΈ Π±ΠΎΠ»ΡΡΠΈΡ
ΡΠΏΡΡΠ³ΠΎΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΈΡ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΉ, Π² ΡΠ°ΠΌΠΊΠ°Ρ
ΠΊΠΎΡΠΎΡΠΎΠΉ ΡΡΠΎΡΠΌΡΠ»ΠΈΡΠΎΠ²Π°Π½Ρ Π΄ΠΈΡΡΠ΅ΡΠ΅Π½ΡΠΈΠ°Π»ΡΠ½ΡΠ΅ ΡΡΠ°Π²Π½Π΅Π½ΠΈΡ ΠΏΠ΅ΡΠ΅Π½ΠΎΡΠ° ΡΠ΅Π½Π·ΠΎΡΠΎΠ² ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΡ
ΠΈ Π½Π΅ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΡ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΉ. ΠΡΠΈ ΡΡΠΎΠΌ Π½Π΅ΠΎΠ±ΡΠ°ΡΠΈΠΌΡΠ΅ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΈ Π²ΠΊΠ»ΡΡΠ°ΡΡ Π² ΡΠ΅Π±Ρ
Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΈ ΠΏΠΎΠ»Π·ΡΡΠ΅ΡΡΠΈ ΠΈ ΠΏΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠΈΠ΅ Π΄Π΅ΡΠΎΡΠΌΠ°ΡΠΈΠΈ. Π Π΅ΡΠ΅Π½ΠΈΠ΅ Π·Π°Π΄Π°ΡΠΈ ΠΏΠΎΠ»ΡΡΠ΅Π½ΠΎ Ρ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΠ΅ΠΌ Π°Π½Π°Π»ΠΈΡΠΈΡΠ΅ΡΠΊΠΈΡ
ΠΈ ΡΠΈΡΠ»Π΅Π½Π½ΡΡ
ΠΌΠ΅ΡΠΎΠ΄ΠΎΠ². ΠΡΠΎΠ°Π½Π°Π»ΠΈΠ·ΠΈΡΠΎΠ²Π°Π½ΠΎ Π²Π»ΠΈΡΠ½ΠΈΠ΅ ΠΏΠ°ΡΠ°ΠΌΠ΅ΡΡΠΎΠ² ΡΠΏΡΠΎΡΠ½Π΅Π½ΠΈΡ ΠΈ
Π²ΡΠ·ΠΊΠΎΡΡΠΈ Π½Π° Π΄Π΅ΡΠΎΡΠΌΠΈΡΠΎΠ²Π°Π½ΠΈΠ΅ ΡΡΠ΅Π΄